Failures of the One-Step Learning Algorithm

نویسنده

David J.C. MacKay

چکیده

The Hinton network (Hinton, 2001, personal communication) is a deterministic mapping from an observable space x to an energy function E(x;w), parameterized by parameters w. The energy defines a probability P (x|w) = exp(−E(x;w))/Z(w). A maximum likelihood learning algorithm for this density model takes steps ∆w ∝ −〈g〉0+ 〈g〉∞ where 〈g〉0 is the average of the gradient g = ∂E/∂w evaluated at points x drawn from the data density, and 〈g〉∞ is the average gradient for points x drawn from P (x|w). If T is a Markov chain in x-space that has P (x|w) as its unique invariant density then we can approximate 〈g〉∞ by taking the data points x and hitting each of them I times with T , where I is a large integer. In the one-step learning algorithm of Hinton (2001), we set I to 1. In this paper I give examples of models E(x;w) and Markov chains T for which the true likelihood is unimodal in the parameters, but the one-step algorithm does not necessarily converge to the maximum likelihood parameters. It is hoped that these negative examples will help pin down the conditions for the one-step algorithm to be a correctly convergent algorithm. The Hinton network (Hinton, 2001, personal communication) is a deterministic mapping from an observable space x of dimension D to an energy function E(x;w), parameterized by parameters w. The energy defines a probability P (x|w) = exp(−E(x;w)) Z(w) , (1) where Z(w) = ∫ dx exp(−E(x;w)) (2) is the hard-to-evaluate normalizing constant or partition function. A maximum likelihood learning algorithm for this density model takes steps ∆w ∝ −〈g〉0 + 〈g〉∞ , (3) where 〈g〉0 is the average of the gradient g = ∂E/∂w evaluated at points x drawn from the data density, and 〈g〉∞ is the average gradient for points x drawn from P (x|w). If T is a Markov chain in x-space that has P (x|w) as its unique invariant density then we can approximate 〈g〉∞ by taking the data points x and hitting each of them I times with T , where I

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The effect of emotion regulation training in decreasing emotion failures and self-injurious behaviors among students suffering from specific learning disorder(SLD)

Background: A great deal of attention has been given to the study of learning disorders. Hence, the aim of this research was to study the effect of emotion regulation training in decreasing emotion failures and self-injurious behaviors among students suffering from specific learning disorder. Methods: This was an experimental study with the pre-test, post-test and a control group. Research p...

متن کامل

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Wavelet Neural Network with Random Wavelet Function Parameters

The training algorithm of Wavelet Neural Networks (WNN) is a bottleneck which impacts on the accuracy of the final WNN model. Several methods have been proposed for training the WNNs. From the perspective of our research, most of these algorithms are iterative and need to adjust all the parameters of WNN. This paper proposes a one-step learning method which changes the weights between hidden la...

متن کامل

A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf

Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...

متن کامل

A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf

متن کامل

Online Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features

Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

Failures of the One-Step Learning Algorithm

نویسنده

چکیده

منابع مشابه

The effect of emotion regulation training in decreasing emotion failures and self-injurious behaviors among students suffering from specific learning disorder(SLD)

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

Wavelet Neural Network with Random Wavelet Function Parameters

A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf

A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf

Online Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features

عنوان ژورنال:

اشتراک گذاری